Processing DNA molecules as text
نویسندگان
چکیده
منابع مشابه
TICCLops: Text-Induced Corpus Clean-up as online processing system
We present the ‘online processing system’ version of Text-Induced Corpus Clean-up, a web service and application open for use to researchers. The system has over the past years been developed to provide mainly OCR error post-correction, but can just as fruitfully be employed to automatically correct texts for spelling errors, or to transcribe texts in an older spelling into the modern variant o...
متن کاملParallel Text Mining for Large Text Processing
There is an urgent need to develop new text mining solutions using High Performance Computing (HPC) and grid environments to tackle the exponential growth in textual data. Problem sizes are increasing by the day by addition of new text documents. Therefore the aim of this work is to lay the foundations for mining large text datasets (i.e. full text articles) in reasonable timeframes. The task o...
متن کاملLightweight Structured Text Processing
Text is a popular storage and distribution format for information, partly due to generic text-processing tools like Unix grep and sort. Unfortunately, existing generic tools make assumptions about text format (e.g., each line is a record) that limit their applicability. Custom-built tools are one alternative, but they require substantial time investment and programming expertise. We describe a ...
متن کاملFrames-based Text Processing
As part of a larger project to develop an intelligent noticing system, I am designing a module to process textual material. The essential tasks of a text processor can be divided into two operations: 1) Locating a prior context, called a theme, in the story database in which to place new knowledge. I shall call this process Linking; and 2) Mapping the new information in a sentence into that con...
متن کاملText Processing for Classification
These days textual information becomes increasingly available through the Web. This makes text an attractive resource from which to mine knowledge. The major difficulty in mining textual data is that the information is unstructured. Hence the data has to be preprocessed first so as to obtain some form of structured data which is amenable to data mining techniques. This paper focuses on this pre...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Systems and Synthetic Biology
سال: 2010
ISSN: 1872-5325,1872-5333
DOI: 10.1007/s11693-010-9059-y